Efficiency Enhancement of 
Probabilistic Model Building Genetic Algorithms 



Kumara Sastry 
David E. Goldberg 
Martin Pelikan 



IlliGAL Report No. 2004020 
February, 2004 



Illinois Genetic Algorithms Laboratory (IlliGAL) 
Department of General Engineering 
University of Illinois at Urbana- Champaign 

117 Transportation Building 
104 S. Mathews Avenue, Urbana, IL 61801 



Efficiency Enhancement of Probabilistic Model Building Genetic 

Algorithms 



Kumara Sastry 
Illinois Genetic Algorithms Laboratory (IlliGAL), and 
Department of Material Science & Engineering 
University of Illinois at Urbana- Champaign 
ksastry@uiuc . edu 

David E. Goldberg 
Illinois Genetic Algorithms Laboratory (IlliGAL), and 
Department of General Engineering 
University of Illinois at Urbana- Champaign 
deg@uiuc . edu 

Martin Pelikan 
Department of Math & Computer Science 
University of Missouri, St. Louis 
mpelikan@cs.umsl . edu 



Abstract 

This paper presents two different efficiency-enhancement techniques for probabilistic model 
building genetic algorithms. The first technique proposes the use of a mutation operator which 
performs local search in the sub-solution neighborhood identified through the probabilistic 
model. The second technique proposes building and using an internal probabilistic model of 
the fitness along with the probabilistic model of variable interactions. The fitness values of 
some offspring are estimated using the probabilistic model, thereby avoiding computationally 
expensive function evaluations. The scalability of the aforementioned techniques are analyzed 
using facetwise models for convergence time and population sizing. The speed-up obtained by 
each of the methods is predicted and verified with empirical results. The results show that for 
additively separable problems the competent mutation operator requires 0{-\fk\ogm) — where 
k is the building-block size, and m is the number of building blocks — less function evaluations 
than its selectorecombinative counterpart. The results also show that the use of an internal 
probabilistic fitness model reduces the required number of function evaluations to as low as 
1-10% and yields a speed-up of 2-50. 



Introduction 



A key challenge in genetic and evolutionary computation (GEC) research is the design of competent 
genetic algorithms (GAs). By competent we mean GAs that can solve hard problems, quickly, 
reliably, and accurately, and much progress has been made along these lines ( Goldberg, 1 999 



Goldberg, 2002 ). In essence, competent GA design takes problems that were intractable with first 



1 



generation GAs and renders them tractable, oftentimes requiring only a subquadratic number of 
fitness evaluations. However, for large-scale problems, the task of computing even a subquadratic 
number of function evaluations can be daunting. This is especially the case if the fitness evaluation 
is a complex simulation, model, or computation. This places a premium on a variety of efficiency 
enhancement techniques. In essence, while competence leads us from intractability to tractability , 
efficiency enhancement takes us from tractability to practicality. 

In this paper, we propose two different efficiency-enhancement techniques for competent ge- 
netic algorithms in general, and probabilistic model building genetic algorithms in particular: 
(1) Using a competent mutation operator that performs local search in building-block neighbor- 
hoods identified by the probabilistic model of variable interactions ( Sastry Hi Goldberg, 20 04b 



Sastry Hi Goldberg, 2004a| ), and (2) Building and Using an internal probabilistic model of fitness in- 



stead of the more expensive fitness function ( Sastry, Pelikan, Hi Goldberg, 2004] Pelikan Hi Sastry, 2004 ) . 



We develop facetwise models to predict the scalability and speed-up of both efficiency-enhancement 
techniques. Specifically, we summarize and combine the analysis and results presented elsewhere 
( Sastry Hi Goldberg, 200"4bl|Sastry Hi Goldberg, 2004a| [Sastry, Pelikan, Hi Goldberg, 20041 |Pelikan Hi Sastry, 2004 



This paper is organized as follows. The next section gives a brief introduction to extended 
compact genetic algorithm (eCGA), followed by a description of a scalable mutation algorithm. We 
analyze the scalability of the mutation algorithm and the speed-up obtained by using it. Section 
discusses evaluation relaxation in PMBGAs by building and using an internal probabilistic model 
of fitness. We analyze the scalability and the speed-up of the evaluation-relaxation scheme. Finally, 
we outline future research directions followed by conclusions. 

2 Extended Compact Genetic Algorithm (eCGA) 

The extended compact GA proposed by Harik (Harik, 1999), like traditional genetic algorithms, is 



a selectionist search method, where only a subset of better individuals influence the subsequent gen- 
eration of candidate solutions. However, like other probabilistic model building genetic algorithms 
(PMBGAs) flPelikan, Lobo, Hi Goldberg, 2002 1, eCGA replaces the traditional variation operators 



of genetic algorithms by building a probabilistic model of promising solutions and sampling the 
model to generate new candidate solutions. 

The probabilistic model in eCGA is represented by a class of probability models known as 
marginal product models (MPMs). MPMs are formed as a product of marginal distributions on 
a partition of the genes. MPMs also facilitate a direct linkage map with each partition separat- 
ing tightly linked genes. For example, the following MPM, [1,3] [2] [4], for a four-bit problem 
represents that the 1 st and 3 rd genes are linked and 2 nd and 4 th genes are independent. 

In eCGA, both the structure and the parameters of the model are searched and optimized to 
best fit the data. To distinguish between better model instances from worse ones, eCGA uses 
a minimum description length (MDL) metric as the class-selection metric. In essence, the MDL 
metric penalizes both complex as well as inaccurate models. The MDL metric used in eCGA is a 
sum of model complexity, C m , and compressed population complexity, C p . In essence, the model 
complexity, C m , quantifies the model representation size in terms of number of bits required to 
store all the marginal probabilities. Let, a given problem of size I with binary alphabets, have m 
partitions with k{ genes in the i th partition, such that Y^ZLi ki = £■ Then each partition i requires 
l\ — \ independent frequencies to completely define its marginal distribution. Furthermore, each 
frequency is of size log 2 (ra), where n is the population size. Therefore, the model complexity C m , 



2 



(71 



is given by 

C m = log 2 (n)^(2^-l). (1) 
i=i 

The compressed population complexity, C p , represents the cost of using a simple model as 
against a complex one. In essence, the compressed population complexity, C p , quantifies the data 
compression in terms of the entropy of the marginal distribution over all partitions. Therefore, C p 
is evaluated as 

m 2 k i 

C P = 71 mi ~Pi3 lo S2 (Pij) ( 2 ) 
i=l j=l 

where is the frequency of the j th gene sequence of the genes belonging to the i th partition. 
In other words, pij = Hij/ri, where mj is the number of chromosomes in the population (after 
selection) possessing bit-sequence j S [l,2 fci ] for i th partition. 

The MDL metric is used to evaluate alternative probabilistic models (chosen from admissible 
MPMs). Similar to other PMBGAs, eCGA uses a greedy search heuristic is used to find an optimal 
model of the selected individuals in the population. The greedy-search method begins with models 
at a low level of complexity (all independent variables), and then adding complexity when it locally 
improves the MDL metric value. This process continues, until no further improvement is possible. 

Once the best probabilistic model is built, the new individuals are created by sampling the 
probabilistic model. The offspring population are generated by randomly choosing subsets from 
the current individuals according to the probabilities of the subsets as calculated in the probabilistic 
model. 

The population-sizing and the scalability of PMBGAs in general, and the Bayesian optimization 
algorithm and eCGA in particular, have been studied elsewhere ( Pelikan, Goldberg, fe; Cantu-Paz, "2 000a; 



Pelikan, Sastry, fc Goldberg, 2003| Sastry Goldberg, 2004a| ). The models predict that the pop- 



ulation size required to solve a problem with m building blocks (BBs) of size k with a failure rate 
of a = 1/m is given by 

n oc 2 k 1 mlogm, (3) 

and the number of function evaluations is given by 



rife oc 



V d 



Vk- 2 fc m L5 logm, (4) 



where o~bb is fitness- variance of a BB and d is the signal difference between competing BBs 
(Goldberg, Deb, Hi Clark, 19921. In other words, the models predict that for additively separa- 



ble problems, eCGA scales subquadratically, 0(2 fc m L5 log m), with problem size. 



3 Probabilistic Model Building BB-wise Mutation 

As explained in the previous section, eCGA builds marginal product models that yields a direct 
mapping of linkage groups among successful individuals. The probabilistic model yields a global 
neighborhood information and can be effectively exploited by a mutation operators that performs 
local search in the building-block neighborhood. In other words, we can replace a bit-wise muta- 
tion method that scales exponentially, by an an enumerative BB-wise mutation operator as used 
elsewhere ( Sastry &"G oldbcrg , 2004b| ), which scales subquadratically with problem size. 
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The BB-wise mutation uses the best individual, and searches for the best building block for 
each linkage-group identified by the MPM. For example, if model builder identifies m variable- 
groups with k variables in each group, the BB-wise algorithm will select the best BB out of 2 k 
possible ones in each of the m partition. Note that the performance of the BB-wise mutation 
can be slightly improved by using a greedy heuristic to search for the best among competing 
BBs, however, as shown later, the scalability of the probabilistic model building BB-wise mutation 
operator is determined by the population-size required to accurately identify the building blocks. 

It should be noted that while eCGA can only build linkage groups with non-overlapping vari- 
ables, the mutation procedure can be easily used with other linkage identification techniques that 
can handle overlapping BBs such as BOA (Pelikan, Goldber g, Hi Cantu-Paz, 2000b I or DSMDGA 
(Yu, Goldberg, Yassine, Hi Ghen, 2003|). However, since the effect of overlapping interactions be- 



tween variables is similar to that of an exogenous noise ((Goldberg, 2002), crossover is likely to be 
more effective than mutation (Sa stry Hi Goldberg, 2004b ). 

Moreover, we perform linkage identification only once in the initial generation. This offline 
linkage identification works well on problems with BBs of nearly equal salience. However, for 
problems with BBs of non-uniform salience, we would have to perform linkage identification and 
update BB information in regular intervals. Furthermore, it might be more efficient to uti- 
lize both BB-wise mutation and eCGA model sampling simultaneously or sequentially along the 
lines of hybridization ( |Goldberg Hi Voessner, 1999} Sinha Hi Goldberg, 2003| Sinha, 2003 1 and time- 
continuation ( Goldberg, 1999[ |Srivastava, 2002| ) techniques. 

To reiterate, the objective of this paper is to couple linkage identification with a mutation 
operator that performs local search in the BB neighborhood and to verify its effectiveness in solving 
boundedly difficult additively separable problems. Moreover, the aforementioned enhancements can 
be designed on the proposed competent selectomutative GA. 



3.1 Scalability of the BB-wise Mutation 

As mentioned earlier, eCGA without mutation scales as 0(2 k m 1 ' 5 logm). Here, we consider the 
scalability of the selectomutative GA, which depends on two factors: (1) The population size 
required to build accurate probabilistic models of the linkage groups, and (2) the total number of 
evaluations performed by the BB-wise mutation operator to find optimum BBs in all the partitions. 

Pelikan and Sastry fPelikan, Sastry, Hi Goldberg, 2003) observed that to build accurate models 
the population size has to be scaled as, 



O (2 k m im 



<n< 



O {2 k m 2A ) 



(5) 



Since we perform the model building only once, the total number of function evaluations scales as 
the population size. That is, 



O (2 fc m L05 ) < n fe) i < O (2 k m 2A 



(6) 



During BB-wise mutation, we evaluate 2 k — 1 individuals for determining the best BBs in each 
of the m partitions. Therefore, the total number of function evaluations used during BB-wise 
mutation is 



«fe,2 



(2 k - l) m = O (2! 



in 



(7) 



4 



cT 10 
en 
q 
13 

CO 

> 

o 10 : 
o 



10' 




O k = 4, d = 0.25:O(m 1 ' 4 ) 
k = 5, d = 0.20:O(m 14 ) 



4 5 7 10 20 30 

Number of building blocks, m 



(a) Scalability 



50 



^ 6 

c£ 

z> 

1 5 

CD 
CD 

a. „ 
CO 4 



O k = 4, d = 0.25: O(log(m)) 
k = 5, d = 0.20: O(log(m)) 


<>>' 









,r'<> 














o ^s-^ 




o 





3 5 7 10 

Number of building blocks, m 



(b) Speed-Up 



20 



Figure 1: Empirical verification of the scalability and speed-up fEauationllUj) obtained by using the 
probabilistic model building BB-wise mutation over eCGA for the m k-Trap function. The results 
show that the BB-wise mutation scales as (P(2 fc m 1,5 ) and the speed-up scales as 0{yk\ogm). 



From Equations El and the total number of function evaluations scales as 

O (2 fe m 105 ) <n {0 <O (2 k m 2A ) . (8) 
Indeed, we empirically observed ( |Sastry &; Goldberg, 2004a| ) that the number of function evalua- 

(9) 



tions scales as (see Figure 1(a) I 

n fe = 0{2 k m 1 - 5 ). 

The results (Equations 21 and EJ) indicate that the selectomutative algorithm is O {^/klogm 
faster than eCGA in solving boundedly difficult additively separable problems. That is, the speed- 
up — which is defined as the ratio of number of function evaluations required by eCGA to that 
required by the selectomutative GA — is given by 



V 



n fe (eCGA) 



rif c (BBwise Mutation) 



O (v^log 



m 



(10) 



Empirical results shown in Figure 1(b) agrees with the above equation. The results show that the 
probabilistic model building BB-wise mutation is 0(\fkm) times faster than the extended compact 
GA. 



4 Evaluation- Relaxation in PMBGAs 

Evaluation-relaxation schemes are efficiency-enhancement techniques, where an accurate, but computationally- 
expensive fitness evaluation is replaced (or augmented) with a less- accurate, but computationally 
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less expensive fitness evaluation (Sastry, 2001). The low-cost, less-accurate fitness estimate can 



either be (1) Exogenous, as in the case of surrogate (or, approximate) fitness functions, where an ex- 
ternal means is used to develop the fitness estimate (For example, see ( |Barthelemy Haftka, 19 93)), 



or (2) Endogenous, as in fitness inheritance (Smith, Dike, fe Stegmann, 1995), where, the fitness 



estimate is computed internally based on parental fitnesses. 

While the use of exogenous models have been extensively — both empirically and analytically — 
studied (see ( Sastry, 2001 ) and ( Jin, 2003 ) and references therein) , limited attention has been paid 



towards analysis and development of competent methods for building endogenous fitness estimates. 

The endogenous models used in evolutionary-computation literature are naive ( Smith, Dike, k, Stegmann7*1 995 ; 



Zheng, Julstrom, Cheng, 19 97) and analytically have been shown to yield limited speed-up of 
about 1.25, both in single-objective dSastry, Goldberg, fc Pelikan, 2001] ) , and in multiobjective 
cases ( Chen, Goldberg, Ho, &, Sastry, 2002| ). In this study, we develop an efficient and effective 



endogenous probabilistic fitness model, that automatically and adaptively exploits the regulari- 
ties of the search problem. We show that the speed-up obtained by the use of such a model for 
estimating the fitness of some offspring, yields significant speed-up. 

Similar to earlier studies on fitness inheritance, in the proposed method, all the individuals 
in the initial population are evaluated and subsequently a portion of the offspring population 
receives inherited fitness and the other receive actual fitness evaluation. That is, an offspring 
receives inherited fitness with a probability pi, or an evaluated fitness with a probability 1 — pj. 
However, unlike previous works, which used either average or weighted average of parental fitnesses 
as the inherited fitness, here we employ the probabilistic model built by eCGA and estimates of 
linkage-group fitnesses in determining the inherited fitness. Specifically, individuals from parental 
population that received evaluated fitnesses (that is, individuals whose fitnesses were not estimated) 
are used to determine the fitnesses of schemata that are defined by the probabilistic model. The 
schema fitness from different partitions are then used to estimate the fitness of an offspring. The 
procedure is detailed in the following paragraph. 

After the probabilistic model is built and the linkage map is obtained (step 4 of the eCGA 
algorithm outlined in the previous section), we estimate the fitness of schemata using only those 
individuals whose fitnesses were not inherited. In all, we estimate the fitness of a total of YaLi 2 fci 
schemas. Considering our previous example (Section [2j) of a four-bit problem, whose model is 
[1,3] [2] [4], the schemata whose fitnesses are estimated are: {0*0*, 0*1*, 1*0*, 1*1*, *0**, 
***0, ***l}. 

The fitness of a schema, h, is defined as the difference between the average fitness of individuals 
that contain the schema and the average fitness of all the individuals. That is, 

1 1 n ' 

h {i\xiDh} i=l 

where is the total number of individuals that contain the schema h, xi is the i th individual and 
f{xi) is its fitness, n' is the total number of individuals that were evaluated. If a particular schema 
is not present in the population, its fitness is arbitrarily set to zero. Furthermore, it should be 
noted that the above definition of schema fitness is not unique and other estimates can be used. 
The key point however is the use of the probabilistic model in determining the schema fitnesses. 

Once the schema fitnesses across partitions are estimated, the offspring population is created 
as outlined in Section [21 An offspring receives inherited fitness with a probability pi , referred to as 
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the inheritance probability. The inherited fitness is computed as follows: 



-. n m 

/mh (y) = - E / fa) + E fs ( h i e v) (12) 

1=1 1=1 

where y is the offspring individual. It should be noted that the eCGA model yields non-overlapping 
linkage groups and might not be appropriate for problems with overlapping BBs. However, similar 
concepts can be incorporated in other PMBGAs such as the Bayesian optimization algorithm 
(BOA) (Peli kan, Goldberg, Oantu-Paz, 2000b 1 which can handle overlapping BBs. Moreover, 



the inherited fitness can be computed in other manner, but the key is to use the estimates of 
substructure fitnesses in the computation. 

With this understanding of the inheritance mechanism, we will now model the effects of fitness 
inheritance on the scalability of the GA and to predict the speed-up (or efficiency enhancement) 
obtained through fitness inheritance in the following section. 

4.1 Scalability of Using the Internal Fitness Model 

Elsewhere, facetwise models have been developed for analyzing the effect of using the probabilistic 

fitness model on the population sizing and convergence time for a GA success ( Sastry, Pelikan, Goldberg, 2 004) 

The population-sizing model is given by 

n = -c n log(a)2 k a 2 f (l+ Pi ), (13) 

where n is the population size, c n is a problem-dependent constant, k is the BB length, a is the 
probability of failure, and a 2 is the fitness variance. The convergence-time model is given by 



t c = CtV m • 



a 



1 + ^ (14) 



where q is a problem dependent constant. 

We now use the convergence-time and population-sizing models to predict the number of func- 
tion evaluations required for eCGA success: 



n 



fe = n + n(t c -l)(l-pi). (15) 



Recall that all the individuals in the initial population are evaluated and there after on an average 
n(l — Pi) individuals are evaluated. 

To isolate the effect of fitness inheritance on the scalability of eCGA, we consider the ratio of 
total number of function evaluations required with fitness inheritance and that required without 
fitness inheritance. That is, we consider the function-evaluation ratio, nt e ^ r = nj e /nf e {pi = 0). 
From Equations 1131 and 1141 we obtain the following approximation for rif er : 

n /ejr «(l+ Pi ) L5 (l- K )- (16) 

The speed-up that can be obtained through fitness inheritance is given by the inverse of the 
function-evaluation ratio: 

= d+B)" (!-„)■ <17> 
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(a) Verification of function-evaluation-ratio model 



(b) Verification of speed-up model 



Figure 2: Verification of the function-evaluation-ratio model (Equation I16|) and the speed-up model 
(Eauation ll7j) with empirical results on 100-bit OneMax and 10 4-Trap problems. The total number 
of function evaluations is determined such that the failure probability of an eCGA run is at most 
1/m. The results are averaged over 900-3000 independent runs. 



Equation indicates that the function-evaluation ratio increases (or the speed-up reduces) at 
low pi values, reaches a maximum at pi = 0.2. When pi = 0.2 the number of function evaluations 
required is 5% more than that required without inheritance. In other words, the speed-up at 
Pi = 0.2 is 0.95. For inheritance probabilities above 0.2 the function-evaluation ratio decreases 
(speed-up increases) with the inheritance probability. Equation El predicts that the speed-up is 
maximum whenpj = 1.0, however, it should be noted that the models derived are not entirely valid 
for higher pi values (pi > 0.95). While the fitness model built on eCGA requires fitness evaluation 
for about 10% of the population, yielding a speed-up of about 1.8-2.25, the fitness model built on 
BOA requires fitness evaluation for about only 1% of the population, yielding a speed-up of about 
35-50 flPelikan fe Sastry, 20041 ). 

The predicted values of function-evaluation-ratio (Equation HE]) and the speed-up (Equation fTTI 
are verified with empirical results for OneMax and m k-Trap in Figures 2(a) and |2(b)"| As shown 
in Figures 2(a) and |2(b)[ the empirical results agree with the analytical models. Furthermore, 
the agreement for the OneMax problem with the models is good even though the building-block 
identification for the OneMax problem is only partially correct. The results show that the required 
number of function evaluations is almost of halved with the use of fitness inheritance thereby leading 
to a speed-up of 1.75-2.25. This is a significant improvement over a speed-up of 1.25 observed for 
simple GAs with a simple inheritance mechanism. Furthermore, fitness inheritance yields speed-up 
even when the inheritance probabilities is very high, even as high as 0.85-0.99, which is similar 
to the empirical observation of Smith, Dike, and Stegmann (Smit hTDike, fc Bte gmann, 1995Q , As 
mentioned earlier, when we used the Bayesian optimization algorithm instead of eCGA, we observed 
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that fitness evaluation was required only for 1% of the population, yielding a speed-up of about 
35-50. 

Overall, the results suggest that significant efficiency enhancement that can be achieved through 
an inheritance mechanism that incorporates knowledge of important sub-solutions of a problem and 
their partial fitnesses. 



5 Future Work 

We demonstrated two efficiency-enhancement techniques for probabilistic model building genetic 
algorithms that provide significant speed-up. The first method demonstrated the potential of 
inducting global neighborhood information into mutation operations via the automatic discovery of 
linkage groups by probabilistic model building techniques. The second method illustrated the use 
of an endogenous probabilistic fitness model for estimating fitness of some individuals instead of 
the computationally expensive fitness evaluation. The results are very encouraging and warrants 
further research in the following avenues: 

Hybridization of competent crossover and mutation: While we considered a bounding case 
of crossover vs. mutation, it is likely to be more efficient to use an efficient hybrid of competent 
crossover and mutation operators. For example, we can consider a hybrid GA with oscillating 
populations. A large population is used to gather linkage information and used for crossover, 
while a small population is used for searching in BB neighborhood. 

Problems with overlapping building blocks: While this paper considered problems with non- 
overlapping building blocks, many problems have different building blocks that share common 
components. The performance of probabilistic model building BB-wise mutation on problems 
with overlapping building blocks have to be analyzed. Since the effect of overlapping vari- 
able interactions is similar to that of exogenous noise ( IGoldberg, 20021 ), based on our recent 



analysis (Sastry & Goldberg, 2004b), a crossover is likely to be more useful than mutation. 



Moreover, while considering problems with overlapping building blocks, the use of eCGA 
might not be appropriate, however the fitness model building mechanism should still be valid 
which can be used in other more sophisticated PMBGAs such as the Bayesian optimization 
algorithm (Pelikan & Sastry, 2004). 



Problems with non-uniform BB salience: In this paper we considered additively separable 
problems with uniform sub-solution salience. Unlike uniformly-scaled problems, in non- 
uniformly scaled problems BBs are identified sequentially over time. Therefore, in such cases, 
we would need to regularly update the BB information and develop theory to predict the 
updating schedule. The effect of non-uniform building-block salience on the speed-up and 
optimal inheritance proportion should also be investigated. 

Hierarchical problems: One of the important class of nearly decomposable problems is hierar- 
chical problems, in which the building-block interactions are present at more than a single 
level. Further investigation is necessary to analyze the performance of BB-wise mutation on 
hierarchical problems. Additionally, the fitness model building mechanism used in this study 
could be enhanced and incorporated into hBOA and the efficiency enhancement provided by 
inheritance can also be investigated. 
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Additional dimensions of problem difficulty: In this paper we considered one of the dimen- 
sions of GA problem difficulty, deception. However, there are other dimensions of problem 
difficulty (Goldberg, 2002) such as epistasis and external noise. This factors should be in- 



cluded in isolation or in conjunction with other factors of problem difficulty in determining a 
complete picture of efficiency enhancement provided by fitness inheritance. 

Real- World problems: One of the key objectives of analyzing and developing fitness- inheritance 
mechanism is to aid the principled incorporation of such an mechanism in competent genetic 
and evolutionary algorithms for successfully solving complex real-world problems in practical 
time. 



6 Summary & Conclusions 

In this paper, we proposed two efficiency-enhancement techniques for probabilistic model building 
genetic algorithms. The first method is a scalable mutation operator, which performs local search 
in the neighborhood dictated by the probabilistic model. The second method is an evaluation- 
relaxation scheme, where an endogenous probabilistic fitness model is developed and used to esti- 
mate the fitnesses of some of the offspring instead of expensive function evaluations. We analyze 
the scalability and speed-up of both the efficiency-enhancement techniques using facetwise models 
and verify them with empirical results. The results show that for additively-separable problems, the 
competent mutation operator scales as 0(2 k m 1 ' 5 ) — where, k is the building-block size, and m is the 
number of building blocks — and provides a speed-up of 0(y/k log m) over a selectorecombinative 
PMBGA. The results also show that for additively separable problems, by developing and using 
an endogenous probabilistic fitness model, only 1-10% of the population requires actual fitness 
evaluation, providing a speed-up of 2-50. 

Overall, the results in this paper demonstrate that the probabilistic model built in PMBGAs 
can be used in developing a scalable mutation operator and an effective fitness-estimation model, 
which can both provide significant efficiency enhancement and speed-up the GA process, while 
yielding high-quality solutions quickly, reliably and accurately. Additionally, since the speed-up 
provided by the two efficiency-enhancement techniques are nearly independent of each other, the 
combined speed-up obtained by using both simultaneously should be multiplicative of the individual 
speed-ups. 
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